A Powerful Statistical Approach for Large-Scale Differential Transcription Analysis
نویسندگان
چکیده
Next generation sequencing (NGS) is increasingly being used for transcriptome-wide analysis of differential gene expression. The NGS data are multidimensional count data. Therefore, most of the statistical methods developed well for microarray data analysis are not applicable to transcriptomic data. For this reason, a variety of new statistical methods based on count data of transcript reads have been correspondingly proposed. But due to high cost and limitation of biological resources, current NGS data are still generated from a few replicate libraries. Some of these existing methods do not always have desirable performances on count data. We here developed a very powerful and robust statistical method based on beta and binomial distributions. Our method (mBeta t-test) is specifically applicable to sequence count data from small samples. Both simulated and real transcriptomic data showed mBeta t-test significantly outperformed the existing top statistical methods chosen in all 12 given scenarios and performed with high efficiency and high stability. The differentially expressed genes found by our method from real transcriptomic data were validated by qPCR experiments. Our method shows high power in finding truly differential expression, conservatively estimating FDR and high stability in RNA sequence count data derived from small samples. Our method can also be extended to genome-wide detection of differential splicing events.
منابع مشابه
A distinct numerical approach for the solution of some kind of initial value problem involving nonlinear q-fractional differential equations
The fractional calculus deals with the generalization of integration and differentiation of integer order to those ones of any order. The q-fractional differential equation usually describe the physical process imposed on the time scale set Tq. In this paper, we first propose a difference formula for discretizing the fractional q-derivative of Caputo type with order and scale index . We es...
متن کاملA limited memory adaptive trust-region approach for large-scale unconstrained optimization
This study concerns with a trust-region-based method for solving unconstrained optimization problems. The approach takes the advantages of the compact limited memory BFGS updating formula together with an appropriate adaptive radius strategy. In our approach, the adaptive technique leads us to decrease the number of subproblems solving, while utilizing the structure of limited memory quasi-Newt...
متن کاملA new virtual leader-following consensus protocol to internal and string stability analysis of longitudinal platoon of vehicles with generic network topology under communication and parasitic delays
In this paper, a new virtual leader following consensus protocol is introduced to perform the internal and string stability analysis of longitudinal platoon of vehicles under generic network topology. In all previous studies on multi-agent systems with generic network topology, the control parameters are strictly dependent on eigenvalues of network matrices (adjacency or Laplacian). Since some ...
متن کاملSemi-analytical Approach for Free Vibration Analysis of Variable Cross-Section Beams Resting on Elastic Foundation and under Axial Force
in this paper, free vibration of an Euler-Bernoulli beam with variable cross-section resting on elastic foundation and under axial tensile force is considered. Beam’s constant height and exponentially varying width yields variable cross-section. The problem is handled for three different boundary conditions: clamped-clamped, simply supported-simply supported and clamp-free beams. First, the equ...
متن کاملBank efficiency evaluation using a neural network-DEA method
In the present time, evaluating the performance of banks is one of the important subjects for societies and the bank managers who want to expand the scope of their operation. One of the non-parametric approaches for evaluating efficiency is data envelopment analysis(DEA). By a mathematical programming model, DEA provides an estimation of efficiency surfaces. A major problem faced by DEA is that...
متن کامل